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DESCRIPTION 



SAMPLING RATE CONVERSION APPARATUS, CODING APPARATUS, 
DECODING APPARATUS AND METHODS THEREOF 

Technical Field 

[0001] The present invention relates to a sampling rate 
conversion apparatus, coding apparatus, decoding 
apparatus and methods thereof. 



Background Art 

[0002 ] Nowadays, there are many different sampling rates 
such as 44.1 kHz for a compact disk, 32 kHz or 48 kHz 
for DAT (Digital Audio Tape), digital VCR or satellite 

15 television, 48 kHz or 96 kHz for a DVD audio signal. 
Therefore, when an internal sampling rate of a decoder 
of a reproduction apparatus or a recording apparatus is 
different from the sampling rate of data to be decoded, 
it is necessary to change the sampling rate. One such 

20 conventional apparatus that converts this sampling rate 
is described, for example, in Patent Document 1. 
[0003] Also, in recent years, transmission path 
capacities on a network have been significantly improved 
with the popularity of ADSL (Asymmetric Digital 

25 Subscriber Line) and optical fibers in a wired system, 
practical use of W-CDMA (Wideband - Code Division Multiple 
Access) and wireless LAN in a wireless system or the like, * 
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and in line with this trend, there are demands for 
realization of high sense of realism and high quality 
by expanding bandwidth of signal in voice communications . 
[0004 ] At present, there are G.72 6, 729 or the like which 
5 are standardized by ITU (International Telecommunication 
Union) as typical schemes for coding a narrow band signal. 
Furthermore, examples of typical methods for coding a 
wideband signal include G722, G722.1 of ITU-T 
(International Telecommunication Union 

10 Telecommunication Standardization Sector) and AMR- WB or 
the like of 3GPP (The 3rd Generation Partnership Project) . 
[0005] Moreover, with the intention of being used in 
various network environments such as an IP (Internet 
Protocol) network, the voice coding scheme is recently 

15 required to realize a scalable function. The scalable 
function means the function capable of decoding a voice 
signal even from part of a code. With this scalable 
function, it is possible to reduce the occurrence 
frequency of packet loss by decoding a high quality voice 

20 signal using all codes in a communication path under good 
conditions and transmitting only part of the code in a 
communication path under bad conditions . 

It is also possible to produce effects such as an increase 
in efficiency of network resources in multicast 
25 communication. 

[0006] To realize a high quality coding scheme having 
this scalable function, coding must be performed using 
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signals at various sampling rates. For example, if a 
signal having a sampling rate of 8 kHz is coded using 
a method such as G.726, G.729 or the like standardized 
in ITU-T and its error signal is further coded in an area 
5 of sampling rate of 16 kHz, it is possible to improve 
quality, through an extension of the signal bandwidth and 
realize scalability. 

[0007] FIG.l is a block diagram showing the typical 
configuration of a coding apparatus that performs 
10 scalable coding. In this example, the number of layers 
is N=3 and the sampling rate of a signal layer n is 
represented FS(n) and suppose FS ( 1 ) =1 6 [ kHz ] , 
FS (2) = 24 [kHz] and FS ( 3 ) = 3 2 [ kH z ] . 

[0008] An acoustic signal (voice signal, audio signal 
15 or the like) input to downsampling section 12 through 
input terminal 11 is downsampled froma sampling frequency 
of 32 kHz to 16 kHz and given to first layer coding section 
13. First layer coding section 13 determines a first 
code so that perceptual distortion between the input 
20 acoustic signal and the decoded signal which is generated 
after the coding becomes a minimum. This first code is 
sent to multiplexing section 26 and also sent to first 
layer decoding section 14. First layer decoding section 
14 generates a first layer decoded signal using the first 
25 code. Upsampling section 15 performs upsampling on the 
sampling frequency of the first layer decoded signal from 
16 kHz to 24 kHz and gives the upsampled signal to 
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subtractor 18 and adder 21. 

[0009] Furthermore, an acoustic signal input to 
downsampling section 16 through input terminal 11 is 
downsampled from a sampling frequency of 32 kHz to 24 
5 kHz and given to delay section 17. Delay section 17 delays 
the downsampled signal by a predetermined duration. 
Subtractor" 18 calculates the difference between the 
output signal of delay section 17 and the output signal 
of upsampling section 15, generates a second layer 

10 residual signal and gives it to second layer coding section 
19. Second layer coding section 19 performs coding so 
that the perceptual quality of the second layer residual 
signal is improved, determines a second code and gives 
this second code to multiplexing section 26 and second 

15 layer decoding section 20. Second layer decoding section 
20 performs decoding processing using the second code 
and generates a second layer decoded residual signal. 
Adder 21 calculates the sum between above described first 
layer decoded signal and the second layer decoded residual 

20 signal and generates a second layer decoded signal. 
Upsampling section 22 performs upsampling on the sampling 
frequency of the second layer decoded signal from 24 kHz 
to 32 kHz and gives this signal to subtractor 24. 
[0010] Moreover, an acoustic signal input to delay 

25 section 23 through input terminal 11 is delayed by a 
predetermined duration and given to subtractor 24. 
Subtractor 24 calculates the difference between the 



4 




output signal of delay section 23 and the output signal 
of upsampling section 22 and generates a third layer 
residual signal. This third layer residual signal is 
given to third layer coding section 25. Third layer 
5 coding section 25 performs coding on the third layer 
residual signal so that its perceptual quality is improved, 
determines a third code and gives the code to multiplexing 
section 26. Multiplexing section 26 multiplexes the 
codes obtained from first layer coding section 13, second 
10 layer coding section 19 and third layer coding section 
25 and outputs the multiplexing result through output 
terminal 2 7 . 

Patent Document 1 : Unexamined Japanese Patent Publication 
15 No . 2000-68948 

Disclosure of Invention 

Problems to be Solved by the Invention 

[0011] However, as mentioned above, the coding apparatus 
20 which realizes a scalable function based on a time domain 
coding scheme such as G.726, 729, AMR- WB or the like needs 
to convert sampling rates of various signals 
(downsampling section 12, upsampling section 15, 
downsampling section 16 and upsampling section 22 in the 
25 above described example) , which results in a problem that 
the configuration of the coding apparatus becomes 
complicated and the amount of coding processing 
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calculation also increases. Furthermore, the circuit 
configuration of the decoding apparatus that decodes a 
signal coded by this coding apparatus also becomes 
complicated and the amount of decoding processing 
5 calculation increases. 

[0012] It is an object of the present invention to provide 
a sampling rate conversion apparatus and coding apparatus 
that can reduce a circuit scale and also reduce the amount 
of coding processing calculation, a decoding apparatus 
10 that decodes a signal coded by this coding apparatus and 
methods for these apparatuses. 

Means for Solving the Problem 

[0013] The present invention extends an effective 
15 frequency band of a spectrum in a frequency domain instead 
of performing a sampling conversion (especially 
upsampling) in a time domain and thereby obtains a signal 
equivalent to a case where a time domain signal is 
upsampled . 

20 [0014] The sampling rate conversion apparatus of the 
present invention adopts a configuration comprising a 
conversion section that converts an input time domain 
signal to a frequency domain and obtains a first spectrum, 
an extension section that extends the frequency band of 

25 the first spectrum obtained and an insertion section that 
inserts a second spectrum in the extended frequency band 
of the first spectrum after the extension. 




[0015] According to this configuration, the input time 
domain signal is converted to a frequency domain signal 
and the frequency band of the spectrum obtained is extended, 
and it is possible to thereby obtain a signal equivalent 
5 to a signal upsampled in the time domain. Furthermore, 
it is also possible to reduce the circuit scale of the 
coding apparatus and also reduce the amount of coding 
processing calculation. 

[0016] The coding apparatus of the present invention 

10 adopts a configuration comprising a conversion section 
that performs a frequency analysis of a signal having 
an input sampling frequency of Fx with an analysis length 
of 2 • Na and obtains a first spectrum of an Na point, an 
extension section that extends the frequency band of the 

15 first spectrum obtained to an Nb point and a coding section 
that specifies a second spectrum inserted in the extended 
frequency band of the first spectrum after the extension 
and outputs a code representing this second spectrum. 
[0017] This configuration allows a spectrum having a 

20 sampling rate of FS=Fx • Nb/Na to be obtained without 
performing any sampling conversion in the time domain. 
[0018] In the coding apparatus of the present invention 
in the above described configuration, the second spectrum 
is generated based on the first spectrum. 

25 [0019] According to this configuration, it is possible 
to generate an extended spectrum based on information 
obtained by the decoder and thereby realize a low bit 
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rate . 

[0020] In the coding apparatus of the present invention 
in the above described configuration, the second spectrum 
is determined so as to resemble the spectrum included 
5 ina frequency band of Na^ k<Nb out ofthe spectrum obtained 
by the frequency analysis of the input signal having a 
sampling frequency of Fy at a 2 • Nb point. 
[0021] According to this configuration, it is possible 
to determine the extended spectrum relative to the 
10 spectrum of an original signal and thereby obtain a more 
accurate extended spectrum. 

[0022] In the coding apparatus of the present invention 
in the above described configuration, the coding section 
divides the frequency band of Na^k<Nb into two or more 
15 subbands and outputs codes representing the second 
spectrum in subband units. 

[0023] According to this configuration, it is possible 
to obtain the effect of generating a code having a 
scalable function. 
20 [0024] /In the coding apparatus of the present invention 
in the above described configuration, the signal having 
a sampling frequency of Fx is a signal decoded with a 
lower layer of hierarchical coding. 

[0025] According to this configuration, the present 
25 invention can be applied to hierarchical coding made up 
of a coding section having a plurality of layers and the 
hierarchical coding can be realized only with a minimum 
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sampling conversion . 

[0026] The decoding apparatus of the present invention 
adopts a configuration comprising an acquisition section 
that performs a frequency analysis of a signal having 
5 a sampling frequency of Fx with an analysis length of 
2 • Na and acquires a first spectrum in a frequency band 
of 0^k<Na, a decoding section that receives a code and 
decodes a second spectrum in a frequency band of Na^kXNb, 
a generation section that combines the first spectrum 
10 and the second spectrum and generates a spectrum in a 
frequency band of 0^k<Nb, and a conversion section that 
converts the spectrum included in the frequency band of 
0^k<Nb to a time domain signal. 

[0027] According to this configuration, it is possible 
15 to decode a code generated by the coding apparatus 
according to any one of the above described 
configurations . 

[0028] Inthedecoding apparatus of the present invention 
in the above described configuration adopts a 
20 configuration, the second spectrum is generated based 
on the spectrum in a frequency band of OS k<Na. 

[0029] According to this configuration, it is possible 
to decode the code using the coding method of generating 
an extended spectrum based on information obtained with 
25 the decoder and thereby realize a low bit rate. 

[0030] The decoding apparatus of the present invention 
in the above described configuration adopts a 
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configuration, further comprising a section that inserts 
a specified value into a high-frequency part of the 
spectrum after the combination or discards a 
high-frequency part of the spectrum after the combination 
so that the frequency bandwidth of the spectrum after 
the combination obtained by the generation section 
matches a predetermined bandwidth. 

[0031] According to this configuration, a decoded signal 
is generated after adding processing of making the 
bandwidth of the spectrum constant even when the bandwidth 
of the spectrum received changes due to factors such as 
a condition of a network or the like, and it is possible 
to thereby generate a decoded signal at a desired sampling 
rate stably. 

[0032] In the decoding apparatus of the present invention 
in the above described configuration, the signal having 
a sampling frequency of Fx is a signal decoded with a 
lower layer in hierarchical coding. 

[0033] According to this configuration, it is possible 
to decode a code obtained through hierarchical coding 
made up of the coding section having a plurality of layers . 

Advantageous Effect of the Invention 

[ 00 34 ] According to the present invention, it is possible 
to reduce the circuit scale of the coding apparatus and 
also reduce the amount of coding processing calculation. 
It is also possible to provide a decoding apparatus that 
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decodes a signal coded by this coding apparatus. 
Brief Description of Drawings 

[0035] FIG.l is a block diagram showing the typical 
5 configuration of a coding apparatus that performs 
scalable coding; 

FIG. 2 is a block diagram showing the main 
configuration of a spectrum coding apparatus according 
to Embodiment 1; 
10 FIG.3A shows a first spectrum and FIG.3B shows a 

spectrum after an effective frequency band is extended; 

FIG.4A illustrates the effect of processing of 
extending an effective frequency band of a spectrum 
theoretically; 

15 FIG.4B illustrates the effect of processing of 

extending an effective frequency band of a spectrum in 
principle; 

FIG. 5 is a block diagram showing the main 
configuration of a radio transmission apparatus according 
20 to Embodiment 1; 

FIG. 6 is a block diagram showing the internal 
configuration of a coding apparatus according to 
Embodiment 1; 

FIG. 7 is a block diagram showing the internal 
25 configuration of a spectrum coding section according to 
Embodiment 1; 

FIG. 8 is a block diagram showing a variation of the 
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spectrum coding section according to Embodiment 1; 

FIG. 9 is a block diagram showing the main 
configuration of a radio reception apparatus according 
to Embodiment 1; 
5 FIG. 10 is a block diagram showing the internal 

configuration of a decoding apparatus according to 
Embodiment 1; 

FIG. 11 is a block diagram showing the internal 
configuration of a spectrum decoding section according 
10 to Embodiment 1; 

FIG.12A and FIG.12B illustrate the processing 
carried out by a band extension section according to 
Embodiment 1; 

FIG. 13 illustrates how a spectrum is processed at 
15 a combining section and a time domain conversion section 
according to Embodiment 1 to generate a decoded signal; 

FIG.14A is a block diagram showing the main 
configuration on the transmitting side when the coding 
apparatus according to Embodiment 1 is applied to a wired 
20 communications system; 

FIG.14B is a block diagram showing the main 
configuration on the receiving side when the decoding 
apparatus according to Embodiment 1 is applied to a wired 
communications system; 
25 FIG. 15 is a block diagram showing the main 

configuration of a decoding apparatus according to 
Embodiment 2; 



FIG. 16 is a block diagram showing the internal 
configuration of a spectrum decoding section according 
to Embodiment 2; 

FIG. 17 illustrates processing of a correction 
5 section according to Embodiment 2 in more detail; 

FIG. 18 illustrates processing of the correction 
section according to Embodiment 2 in more detail; 

FIG. 19 further illustrates the operation of the 
spectrum decoding section according to Embodiment 2; 
10 FIG.20A further illustrates the operation of the 

spectrum decoding section according to Embodiment 2; 

FIG.20B further illustrates the operation of the 
spectrum decoding section according to Embodiment 2; 

FIG. 21 shows the main configuration of a 
15 communications system according to Embodiment 3; and 

FIG. 22 shows the main configuration of a 
communications system according to Embodiment 4 . 

Best Mode for Carrying Out the Invention 
20 [0036] Now, embodiments of the present invention will 
be described in detail with reference to the accompanying 
drawings . 

[0037] (Embodiment 1) 
25 FIG. 2 is a block diagram showing the main 

configuration of spectrum coding apparatus 100 according 
to Embodiment 1 of the present invention. 



[0038] Spectrum coding apparatus 100 according to this 
embodiment is provided with sampling rate conversion 
section 101, input terminal 102, spectral information 
specification section 106 and output terminal 107. 
5 Furthermore, sampling rate conversion section 101 has 
frequency domain conversion section 103, band extension 
section 104 and extended spectrum assignment section 105. 
[0039] A signal sampled at a sampling rate Fx is input 
to spectrum coding apparatus 100 through input terminal 
10 102. 

[0040] Frequency domain conversion section 103 converts 
a time domain signal to a frequency domain signal 
(frequency domain conversion) by performing a frequency 
analysis of this signal with an analysis length of 2 • 

15 Na and calculates first spectrum Sl(k) (0^k<Na) . Then, 
first spectrum SI (k) calculated is given to band extension 
section 104. Here, a modified discrete cosine transform 
(MDCT) is used for the frequency analysis. The MDCT is 
characterized in that an analysis frame and a successive 

20 frame are overlapped by half on top one another and analysis 
is performed, and thereby distortion between the frames 
is canceled using an orthogonal basis whereby the first 
half portion of the analysis frame becomes an odd function 
and the second half portion of the analysis frame becomes 

25 aneven function. As the method oft he frequencyanalysis, 
it is also possible to use a discrete Fourier transform 
(DFT), discrete cosine transform (DCT) or the like. 
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[0041] Band extension section 104 allocates a new area 
(frequency band) so that a new spectrum can be assigned 

to the extended area following to the frequency k = Na of 

input first spectrum Sl(k) and extends the effective 
5 frequency band of first spectrum Sl(k) to 0 ^ k<Nb . The 

processing of extending this effective frequency band 

will be explained in detail later. 

[ 0042 ] Extended spectrum assignment section 105 assigns 
extended spectrum SI 1 (k) (Na^k<Nb) input from outside to 
10 the frequency band extended by band extension section 
104 and outputs it to spectral information specification 
section 10 6. 

[0043] Spectral information specification section 106 
outputs information necessary to specify extended 

15 spectrum SI 1 (k) out of the spectrum given from extended 
spectrum assignment section 105 as the code through output 
terminal 107. This code is information which shows the 
subband energy of extended spectrum Sl'(k) and 
information which shows an effective frequency band or 

20 the like. Details thereof will also be described later. 
[0044] Next, details of the processing carried out by 
above described band extension section 104 to extend the 
effective frequency band of first spectrum Sl(k) will 
be explained using FIG.3A and FIG.3B. 

25 [0045] FIG.3A shows first spectrum Sl(k) given from 
frequency domain conversion section 103 and FIG.3B shows 
spectrum SI (k) after an effective frequency band is 
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extended by band extension section 104. Band extension 
section 104 allocates the area in which new spectral 
information can, be inserted in the frequency band where 
frequency k of first spectrum Sl(k) is shown in the range 
5 of Na^ k<Nb. The size of this new area is expressed by 
"Nb-Na" . 

[0046] Here, Nb is determined from the relationship 
between sampling rate Fx of the signal given from outside 
through input terminal 102, analysis length 2 • Na in 
10 frequency domain conversion section 103 and sampling rate 
Fy of the signal decoded by a decoding section (not shown) . 
More specifically, Nb is set by the following expression: 

Nb = Na • — 

i\u i\u ...(Expression 1) 

[0047] Furthermore, sampling rate Fy of the signal 
15 decoded by the decoding section when Nb has been determined 
is determined by the following expression: 

Fy = Fx.™ 

i y 1 * ...(Expression 2) 

[0048] For example, when the coding section is designed 
under a condition of Na = 128, Fx=16 kHz and a decoded signal 

20 of Fy=32 kHz is generated by the decoding section, it 
is necessary to set Nb=12 8 • 32/16 = 256. Therefore, in this 
case, an area of 128=^k<256 is allocated. Furthermore, 
as another example, when the coding section is designed 
under a condition of Na=12 8 , Nb=3 8 4 , Fx=8 kHz, the sampling 

25 rate of the decoded signal generated by the decoding 
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section becomes Fy=8 • 384/128=24 kHz. 

[0049] FIG.4A and FIG.4B illustrate the effect of the 
processing of extending the effective frequency band of 
the spectrum carried out by band extension section 104 
5 in principal. FIG.4A shows the spectrum Sa(k) obtained 
when performing a frequency analysis of the signal of 
sampling rate Fx with an analysis length of 2 • Na . The 
horizontal axis shows a frequency and the vertical axis 
shows spectrum intensity. 
10 [0050] The signal effective frequency band is 0 to Fx/2 
from the Nyquist theorem. The analysis length is 2 • 
Na at this time, and therefore, the range of frequency 
index k is 0 ^ k<Na and the frequency resolution of 
spectrum Sa(k) is Fx/ (2 • Na) . On the other hand, when 
15 spectrum Sb(k) obtained by the frequency analysis with 
an analysis length of 2 • Nb after the same signal is 
upsampled to sampling rate Fy is shown in FIG.4B, the 
signal effective frequency band is extended to 0 to Fy/2 
and the range of frequency index k is 0^ k<Nb. Here, 
20 when Nb satisfies (Expression 1) , frequency resolution 
Fy/(2 • Nb) of spectrum Sb ( k) i s equal t o Fx / ( 2 • Na) . That 
is, spectrum Sa(k) in band 0^k<Na is equal to spectrum 
Sb(k) . Looking from the opposite point of view, this 
means that when the band of spectrum Sa(k) (0±Sk<Na) is 
25 extended to Nb, spectrum Sb(k) matches the spectrum 
obtained by the frequency analysis with the analysis 
length of 2 • Nb after upsampling the signal of sampling 
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Fx to sampling Fy . Using this principle, it is possible 
to obtain a spectrum equivalent to the upsampled signal 
without upsampling in the time domain. 

[0051] In this way, sampling rate conversion section 101 
5 converts the input time domain signal to a frequency 
domain signal and extends the effective frequency band 
of the spectrum obtained, and therefore, it is possible 
to obtain a spectrum equivalent to the spectrum obtained 
by converting the frequency of the signal upsampled in 
10 the time domain. 

[0052] Since the signal output from sampling rate 
conversion section 101 is a signal inthe frequency domain, 
when the signal in the time domain is necessary, it may 
be possible to provide a time domain conversion section 
15 and perform reconversion to the time domain. In above 
described example, sampling rate conversion section 101 
is set inside spectrum coding apparatus 100, and therefore 
the signal is input to spectral information specification 
section 106 as the same frequency domain signal without 
20 being returned to the time domain signal and a code is 
generated . 

[0053] Here, the coding rate of the code output from 
spectral information specification section 106 changes 
by adjusting the selection of the extended spectrum input 
25 to extended spectrum assignment section 105 and the 
specific method of the spectral information by spectral 
information specification section 106. That is, the 
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processing of part in sampling rate conversion section 
101 has a large influence on the coding, too. This means 
that spectrum coding apparatus 100 realizes the 
conversion of the sampling rate and coding of the input 
5 signal at the same time. 

[0054] Here, for simplicity of explanation, the case 
where an extended spectrum is assigned to the original 
spectrum by extended spectrum assignment section 105 has 
been explained as an example, but the processing carried 

10 out by spectral information specification section 106 
is intended to output the information necessary to specify 
an extended spectrum as the code, and it is sufficient 
that at least the extended spectrum to be assigned is 
specified, and therefore the extended spectrum need not 

15 always be actually assigned. 

[0055] Furthermore, upsampling has been explained here 
as an example of the sampling rate conversion but the 
above described principle can also be applied to 
downsampling . 

20 [0056] FIG. 5 is a block diagram showing the main 
configuration of radio transmission apparatus 130 when 
coding apparatus 120 according to this embodiment is 
mounted on the transmitting side of the radio 
communications system . 

25 [0057] This radio transmission apparatus 130 includes 
coding apparatus 120, input apparatus 131, A/D conversion 
apparatus 132, RF modulation apparatus 133 and antenna 




134 . 

[0058] Input apparatus 131 converts sound wave Wll 
audible to human ears to an analog signal which is an 
electric signal and outputs it toA/D conversion apparatus 
5 132. A/D conversion apparatus 132 converts this analog 
signal to a digital signal and outputs it to coding 
apparatus 120 (signal SI) . Coding apparatus 120 encodes 
input digital signal SI, generates a coded signal and 
outputs it to RF modulation apparatus 133 (signal S2). 
10 RF modulation apparatus 133 modulates coded signal S2, 
generates a modulated coded signal and outputs it to 
antenna 134. Antenna 134 transmits the modulated coded 
signal as radio wave W12 . 

[0059] FIG. 6 is a block diagram showing the internal 
15 configuration of above described coding apparatus 120. 

Here, the case where hierarchical coding (scalable 

coding) is performed will be explained as an example. 

[0060] Coding apparatus 120 includes input terminal 121, 

downsampling section 122, first layer coding section 123, 
20 first layer decoding section 124, delay section 126, 

spectrum coding section 100a, multiplexing section 127 

and output terminal 128. 

[0061] Acoustic signal SI of sampling rate Fy is input 
to input terminal 121. Downsampling section 122 applies 
25 downsampling to signal SI input through input terminal 
121 and generates and outputs a signal having a sampling 
rate Fx. First layer coding section 123 encodes this 




downsampled signal and outputs the code obtained to 
multiplexing section (multiplexer) 127 and also outputs 
it to first layer decoding section 124. First layer 
decoding section 124 generates a decoded signal of the 
5 first layer based on this code. 

[0062] On the other hand, delay section 126 gives a delay 
of a predetermined length to signal SI input through input 
terminal 121. Suppose the magnitude of this delay has 
the same value as a time delay generated when the signal 
10 has passed through downsampling section 122, first layer 
coding section 123 and first layer decoding section 124. 
Spectrum coding section 100a performs spectrum coding 
using signal S3 having a sampling rate Fx output from 
first layer decoding section 124 and signal S4 having 
15 a sampling rate Fy output from delay section 126 and outputs 
generated code S5 to multiplexing section 127. 
Multiplexing section 127 multiplexes the code obtained 
by first layer coding section 123 with code S5 obtained 
by spectrum coding section 100a and outputs the 
20 multiplexed signal as output code S2 through output 
terminal 128. This output code S2 is given to RF 
modulation apparatus 133. 

[0063] FIG. 7 is a block diagram showing the internal 
configuration of above described spectrum coding section 
25 100a. This spectrum coding section 100a has a basic 
configuration similar to that of spectrum coding 
apparatus 100 shown in FIG. 2, and therefore the same 



components are assigned the same reference numerals and 
explanations thereof will be omitted. 

[0064] A feature of spectrum coding section 100a is to 
give extended spectrum SI' (k) (Na ^ k<Nb) using the 
5 spectrum of input signal S3 having sampling rate Fy . 
According to this, since a target signal to determine 
extended spectrum SI' (k) is given, and therefore the 
accuracy of extended spectrum Sl'(k) improves and as a 
result, the effect of leading to quality improvement is 
10 obtained. 

[0065] Frequency domain conversion section 112 performs 
a frequency analysis of signal S4 of the sampling rate 
Fy input through input terminal 111 with analysis length 
2 * Nb and obtains second spectrum S2 (k) (0^k<Nb) . Here, 
15 suppose that the relationship shown in (Expression 1) 
holds between sampling frequencies Fx, Fy and analysis 
lengths Na, Nb . 

[0066] Spectral information specification section 106 
determines the code which shows extended spectrum SI ' (k) . 

20 Here, extended spectrum SI ■ (k) is determined using second 
spectrum S2(k) obtained by frequency domain conversion 
section 112 . Spectral information specification section 
106 determines a code in two steps; a step of determining 
the shape of extended spectrum SI" (k) and a step of 

25 determining the gain of extended spectrum Sl'(k). 

[0067] The step of determining the shape of extended 
spectrum Sl'(k) will be explained below first. 



[0068] In this step, extended spectrum Sl'(k) is 
determined using the band 0^k<Na of first spectrum SI ( k) . 
As the specif ic method thereof , first spectrum SI ( k) which 
is separated by a certain fixed value C on the frequency 
5 axis as shown in the following expression is copied to 
extended spectrum SI 1 (k) . 

Sl'(k) =Sl(k-C) (Na<k<Nb) ... (Expression 3 ) 
[0069] Here, C is a predetermined fixed value and needs 
to satisfy the condition of C^Na. According to this 

10 method, the information indicating the shape of extended 
spectrum SI' (k) is not output as the code. 
[0070] As another method, instead of above described 
fixed value C, it may be also possible to use variable 
T which takes a value in a certain predetermined range 

15 T M i N to T M ax and output value T 1 of variable T when the 
shape of extended spectrum SI 1 (k) is most similar to that 
of second spectrum S2(k) as part of the code. At this 
time, extended spectrum SI' (k) is shown by the following 
expression : 

20 Sl'(k)=Sl{k~r) (Na<k<Nb) ...(Expression 4) 

[0071] Next, thestepof determining the gain of extended 
spectrum Sl'(k) obtained by spectrum information 
specification section 106 will be explained below. 
[0072 ] The gain of extended spectrum S 1 1 (k) is determined 

25 so as to match the power in the band Na^k<Nb of second 
spectrum S2(k). More specifically, according to the 
following expression, deviation V of the power is 
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calculated, and an index obtained by quantizing this value 
is output as the code through output terminal 107. 



V = 



Nb-l 



k=Na 



Nb-\ 



(Expression 5) 



k=Na 



[0073] Furthermore, it may be also possible to adopt a 
mode in which extended spectrum SI 1 (k) is divided into 
a plurality of subbands and determine a code independently 
for each subband. In such a case, in the step of 
determining the shape of extended spectrum SI 1 (k) , it 
is possible to determine T 1 expressed by (Expression 4) 
for each subband and output it as the code and determine 
only one common T 1 and output it as the code. Then, in 
the step of determining the gain of extended spectrum 
SI' (JO, deviation V(j) of the power is calculated for 
each subband and an index obtained by quantizing this 
value is output as the code through output terminal 107. 
The amount of variation of the power for each subband 
is expressed by the following expression: 



VU) = 



BHU) 
k=BL(j) 



BH(j) 
k=BLU) 



(Expression 6) 



[0074 ] where, j denotes a subband number and BL ( j ) 
20 denotes a frequency index corresponding to the minimum 
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frequency of the jth subband, BH ( j ) denotes a frequency 
index corresponding to the maximum frequency of the jth 
subband. By adopting the configuration in which a code 
is output for each subband in this way, it is possible 
5 to realize the scalable function. 

[ 0 07 5 ] Apart from the mode in which second spectrum S2 ( k) 
is calculated as shown in FIG. 7, it is also possible to 
adopt a mode (spectrum coding section 100b) in which the 
.signal of sampling rate Fy is LPC-analyzed as shown in 

10 FIG. 8. That is, it is also possible to LPC-analyze the 
signal of sampling rate Fy, obtain an LPC coefficient 
and determine extended spectrum SI 1 (k) using this LPC 
coefficient. In this configuration, it is possible to 
apply a DFT to the LPC coefficient and convert it to 

15 spectral information and determine extended spectrum 
Sl f (k) using this spectrum. 

[0076] In this way, according to the coding apparatus 
of this Embodiment, it is possible to reduce the circuit 
scale of the coding apparatus and also reduce the amount 
20 of coding processing calculation. 

[0077] In addition to the above described effect, the 
following effect is obtained when the coding apparatus 
of this Embodiment is applied to scalable coding. 

[0078 ] As in the case of the conventional art, when the 
25 sampling rate is converted in the time domain, the input 
signal needs to be passed through a low pass filter 

(hereinafter referred to as "LPF") to avoid aliasing. 

25 



Generally, when filtering processing is performed in the 
time domain, a time delay occurs in the output signal 
with respect to the input signal . When an FIR -type filter 
is applied to the LPF, the filter order must be increased 
5 to make its cutoff characteristic steep, which produces 
not only a substantial increase of the amount of 
calculation but also a time delay equivalent to the half 
of sample numbers of the filter order. 

[0079] For example, when a 256th-order filter is applied 
10 to a signal having a sampling frequency Fs = 24 kHz, a delay 
equal to or greater than 5 ms is produced by only a sampling 
rate conversion. The occurrence of such a delay, when 
the 256th-order filter is applied to a bidirectional 
speech communication, causes a problem because the 
15 reaction of the other side of communication is perceived 
as if it becomes slower. 

[0080] Furthermore, when using an IIR-type filter for 
the LPF, the cutoff characteristic can be made steeper 
even if the order is reduced comparatively and the delay 

20 never becomes as big as that of the FIR-type filter. 
However, in the case of using the IIR-type filter, it 
is not possible to design such a filter that the amount 
of delay which occurs in all the frequencies like the 
FIR-type filter becomes constant. In scalable coding, 

25 when a signal after the sampling rate conversion is 
subtracted from the input signal during the scalable 
coding, it is necessary to give a predetermined delay 

26 




amount to the input signal according to the time delay 
of the signal after the sampling rate conversion. However, 
when an IIR-type LPF is used, the amount of delay with 
respect to the frequency is not constant, and therefore 
5 the problem that the subtraction processing cannot be 
performed accurately occurs . 

[ 0081 ] The coding apparatus of this embodiment can solve 
these problems which occur during scalable coding. 
[0082] FIG. 9 is a block diagram showing the main 
10 configuration of radio reception apparatus 180 which 
receives a signal transmitted from radio transmission 
apparatus 13 0. 

[0083] This radio reception apparatus 180 is provided 
with antenna 181, RF demodulation apparatus 182, decoding 
15 apparatus 170, D/A conversion apparatus 183 and output 
apparatus 18 4. 

[0084] Antenna 181 receives a digital coded acoustic 
signal as radio wave W12, generates a digital received 
coded acoustic signal which is an electric signal and 
20 gives it to RF demodulation apparatus 182. RF 
demodulation apparatus 182 demodulates the received coded 
acoustic signal from antenna 181, generates a demodulated 
coded acoustic signal Sll and gives it to decoding 
apparatus 17 0. 

25 [0085] Decoding apparatus 170 receives digital 

demodulated coded acoustic signal Sll from RF 
demodulation apparatus 182, performs decoding processing, 
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generates digital decoded acoustic signal S12 and gives 
it to D/A conversion apparatus 183. D/A conversion 
apparatus 183 converts digital decoded acoustic signal 
S12 from decoding apparatus 170, generates an analog 
5 decoded voice signal and gives it to output apparatus 
184. Output apparatus 184 converts the analog decoded 
voice signal which is an electric signal to vibration 
of the air and outputs it as sound wave W13 audible to 
human ears . 

10 [0086] FIG. 10 is a block diagram showing the internal 
configuration of above described decoding apparatus 170. 
Also here, a case where a signal generated by hierarchical 
coding is decoded will be explained as an example. 
[0087] This decoding apparatus 170 is providedwith input 

15 terminal 171, separation section 172, first layer 
decoding section 173, spectrum decoding section 150 and 
output terminal 176. 

[008 8 ] Code Sll generated by hierarchical coding is input 
from RF demodulation apparatus 182 to input terminal 17 1 . 

20 Separation section 172 separates demodulated coded 
acoustic signal Sll input through input terminal 171 and 
generates a code for first layer decoding section 173 
and a code for spectrum decoding section 150. First layer 
decoding section 173 decodes the decoded signal of 

25 sampling rate Fx using the code obtained from separation 
section 172 and gives this decoded signal S13 to spectrum 
decoding section 150. Spectrum decoding section 150 
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performs spectrum decoding which will be described later 
on code S14 separated by separation section 172 and signal 
S13 of sampling rate Fx generated by first layer decoding 
section 173, generates decoded signal S12 of sampling 
5 rate Fy and outputs this through output terminal 176. 
[0089] FIG. 11 is a block diagram showing the internal 
configuration of above described spectrum decoding 
section 15 0. 

[0090] This spectrum decoding section 150 includes input 
10 terminals 152, 153, frequency domain conversion section 
154, band extension section 155, decoding section 156, 
combining section 157, time domain conversion section 
158 and output terminal 159. 

[0091] Signal S13 sampled at sampling rate Fx is input 
15 to input terminal 152. Furthermore, code S14 related to 
extended spectrum SI 1 (k) is input to input terminal 153 . 
[0092] Frequency domain conversion section 154 performs 
a frequency analysis of time domain signal S13 input from 
input terminal 152 with an analysis length of 2 • Na and 
20 calculates first spectrum Sl(k) . A modified discrete 
cosine transform (MDCT) is used as the frequency analysis 
method. The MDCT is characterized in that an analysis 
frame and a successive frame are overlapped by half on 
top one another and analysis is performed, and thereby 
25 distortion between the frames is canceled using an 
orthogonal basis whereby the first half portion of the 
analysis frame becomes an odd function and the second 
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half portion of the analysis frame becomes an even function. 
First spectrum Sl(k) obtained in this way is given to 
band extension section 155. As the frequency analysis 
method, a discrete Fourier transform ( D FT ) , discrete 
5 cosine transform (DCT) or the like can also be used. 
[0093] Band extension section 155 allocates an area so 
that a new spectrum can be assigned to the extended area 
following to the frequency k=Na of input first spectrum 
Sl(k) and ensures that the band of first spectrum Sl(k) 

10 become 0 ^ k<Nb . First spectrum SI ( k ) whose band has been 
extended is output to combining section 157. 

[0094] On the other hand, decoding section 156 decodes 
code S14 related to extended spectrum SI 1 (k) input through 
input terminal 153, obtains extended spectrum SI' (k) and 

15 outputs it to combining section 157. 

[0095] Combining section 157 combines first spectrum 
SI (k) given from band extension section 155 and extended 
spectrum SI' (k) . This combination is realized by 
inserting extended spectrum Sl f (k) in the band Na^ k<Nb 

20 of first spectrum Sl(k) . First spectrum Sl(k) obtained 
through this processing is output to time domain 
conversion section 158. 

[0096] Time domain conversion section 158 applies time 
domain conversion processing which is equivalent to the 
25 inverse conversion of the frequency domain conversion 
carried out by spectrum coding section 100a and generates 
signal S12 in the time domain through a multiplication 



of an appropriate window function and a overlap-add 
processing. Signal S12 in the time domain generated in 
this way is output as the decoded signal through output 
terminal 15 9. 

5 [0097] Next, the processing to be carried out by band 
extension section 155 will be explained using FIG.12A 
and FIG.12B. 

[0098] FIG.12A shows first spectrum Sl(k) given from 
frequency domain conversion section 154. FIG.12B shows 

10 the spectrum obtained as a result of the processing of 
band extension section 155 and an area in which new spectral 
information can be stored is allocated in the band in 
which frequency k is expressed in the range of Na^ k<Nb. 
The size of this newarea is expressedby Nb-Na. Nbdepends 

15 on the relationship among sampling rate Fx of the signal 
given from input terminal 152, analysis length 2 • Na of 
frequency domain conversion section 154 and sampling rate 
Fy of the signal decoded by spectrum decoding section 
150, and it is possible to set Nb according to the following 

20 expression: 

Nb = Na- 

i\u ivu ...(Expression 7) 

[0099] Also, when Nb is determined, sampling rate Fy of 
the signal decoded by spectrum decoding section 150 is 
determined by the following expression: 

25 Fy = Fx ...(Expression 8) 
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[0100] For example, when a decoded signal having a 
sampling rate of Fy=32 kHz is generated by spectrum 
decoding section 150 under the condition where the 
sampling rate of the input signal is Fx=16 kHz and the 
analysis length of frequency domain conversion section 
154 is Na=128, it is necessary to set Nb=128 • 32/16 = 256 
at band extension section 155. Therefore, in this case, 
band extension section 155 allocates the area of 128 
=sk<256. In another example, when the sampling rate of 
the input signal is Fx=8 kHz, the analysis length of 
frequency domain conversion section 154 is Na=128 and 
the amount of extension of band extension section 155 
is Nb=384, the sampling rate of the decoded signal 
generated at spectrum decoding section 150 is Fy=8 • 
384/128=24 kHz. 

[0101] FIG. 13 shows how a decoded signal is generated 
through the processing of combining section 157 and time 
domain conversion section 158. 

[0102] Combining section 157 inserts extended spectrum 
SI 1 (k) (Na = k<Nb) in the band of Na^ k<Nb of first spectrum 
Sl(k) where a band has been extended and sends combined 
first spectrum Sl(k) (0=^k<Nb) obtained by insertion to 
time domain conversion section 158. Time domain 
conversion section 158 generates a decoded signal in the 
time domain and this allows a decoded signal having a 
sampling rate of FS ( = Fx • Nb/Na) . 

[0103] In this way, the decoding apparatus according to 
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this embodiment can decode a signal coded by the coding 
apparatus according to this embodiment. 

[0104] Here, the case where the coding apparatus or the 
decoding apparatus according to this embodiment is 
5 applied to a radio communications system has been 
explained as an example, but the coding apparatus or the 
decoding apparatus according to this embodiment can also 
be applied to a wired communications system as shown below. 
[0105] FIG.14A is a block diagram showing the main 

10 configuration of the transmitting side when the coding 
apparatus according to this embodiment is applied to a 
wired communications system. The same components as 
those shown in FIG. 5 are assigned the same reference 
numerals and explanations thereof will be omitted. 

15 [0106] Wired transmission apparatus 140 includes coding 
apparatus 120, input apparatus 131 and A/D conversion 
apparatus 132 and the output thereof is connected to 
network N 1 . 

[0107] The input terminal of A/D conversion apparatus 
20 132 is connected to the output terminal of input apparatus 
131. The input terminal of coding apparatus 120 is 
connected to the output terminal of A/D conversion 
apparatus 132. The output terminal of coding apparatus 
120 is connected to network Nl . 
25 [0108] Input apparatus 131 converts sound wave Wll 
audible to human ears to an analog signal which is an 
electric signal and gives it to A/D conversion apparatus 
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132. A/D conversion apparatus 132 converts an analog 
signal to a digital signal and gives it to coding apparatus 
120. Coding apparatus 120 encodes an input digital signal, 
generates a code and outputs it to network Nl. 
5 [0109] FIG.14B is a block diagram showing the main 
configuration of the receiving side when the decoding 
apparatus according to this embodiment is applied to a 
wired communications system. The same components as 
those shown in FIG. 9 are assigned the same reference 
10 numerals and explanations thereof will be omitted. 

[0110] Wired reception apparatus 190 includes reception 
apparatus 191 connected to network Nl, decoding apparatus 
170, D/A conversion apparatus 183 and output apparatus 
184 . 

15 [0111] The input terminal of reception apparatus 191 is 
connected to network Nl . The input terminal of decoding 
apparatus 170 is connected to the output terminal of 
reception apparatus 191. The input terminal of D/A 
conversion apparatus 183 is connected to the output 

20 terminal of decoding apparatus 170. The input terminal 
of output apparatus 184 is connected to the output terminal 
of D/A conversion apparatus 183. 

[0112] Reception apparatus 191 receives a digital coded 
acoustic signal from network Nl, generates a digital 
25 received acoustic signal and gives it to decoding 
apparatus 170. Decoding apparatus 170 receives the 
received acoustic signal from reception apparatus 191, 
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carries out decoding processing on this received acoustic 
signal, generates a digital decoded acoustic signal and 
gives it to D/A conversion apparatus 183. D/A conversion 
apparatus 183 converts the digital decoded voice signal 
5 from decoding apparatus 170, generates an analog decoded 
voice signal and gives it to output apparatus 184 . Output 
apparatus 184 converts the analog decoded acoustic signal 
which is an electric signal to vibration of the air and 
outputs it as sound wave W13 audible to human ears. 
10 [0113] In this way, according to the above described 
configuration, it is possible to provide a wired 
transmission/reception apparatus having operations and 
effects similar to those of the above described 
transmission/reception apparatus. 

15 

[0114] (Embodiment 2) 

FIG. 15 is a block diagram showing the main 
configuration of decoding apparatus 270 according to 
Embodiment 2 of the present invention. This decoding 

20 apparatus 270 has a basic configuration similar to that 
of decoding apparatus 170 shown in FIG. 10, and therefore 
the same components are assigned the same reference 
numerals and explanations thereof will be omitted. 
[0115] A feature of this embodiment is to generate a 

25 decoded signal having a desired sampling rate by 
correcting maximum frequency index Nb of first spectrum 
Sl(k) (0^ k<Nb) after combination processing to desired 
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value Nc . 

[0116] Spectrum decoding section 250 carries out 
spectrum decoding using code S14 separated by separation 
section 172, signal S13 of sampling rate Fx generated 
5 by first layer decoding section 173 and coefficient Nc 

(signal S21) input through input terminal 271. Spectrum 
decoding section 250 then outputs the decoded signal of 
sampling rate Fy obtained through output terminal 176. 
When the analysis length of frequency domain conversion 
10 of spectrum decoding section 250 is 2 • Na, sampling rate 
Fy of the decoded signal is expressed Fy = Fx • Nc /Na . 

[0117] FIG. 16 is a block diagram showing the internal 
configuration of above described spectrum decoding 
section 2 50. 

15 [0118] Coefficient Nc input through input terminal 271 
is given to correction section 251 and time domain 
conversion section 158a. 

[0119] Correction section 251 corrects the effective 
band of first spectrum SI ( k) (0±Sk<Nb) given from combining 
20 section 157 to 0^k<Nc based on coefficient Nc (signal 
S21) given through input terminal 271. Correction 
sect ion 2 51 then gives fir st spectrum S 1 ( k ) (0^k<Nc) after 
the band correction to time domain conversion section 
158a . 

25 [0120] Time domain conversion section 158a applies 
conversion processing to first spectrum Sl(k) (0^k<Nc) 
given from correction section 251 under an analysis length 
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of 2 • Nc according to coefficient Nc given through input 
terminal 271, performs a multiplication with an 
appropriate window function and a over lap -add processing, 
generates a signal inthe time domain and outputs it through 
5 output terminal 159. The sampling rate of this decoded 
signal becomes FS=Fx • Nc/Na. 

[0121] FIG. 17 and FIG. 18 are diagram illustrating 
processing by correction section 251 in more detail. 
[0122] FIG. 17 shows processing by correction section 251 

10 when Nc<Nb . The band of first spectrum SI ( k) (signal S2 1 ) 
given from combining section 157 is 0 ^ k<Nb . Therefore, 
correction section 251 deletes a spectrum in the range 
of Nc^k<Nb so that the band of this first spectrum Sl(k) 
becomes 0^ k<Nc. As a result, first spectrum Sl(k) (0 

15 ^k<Nc) (signal S22) obtained is given to time domain 
conversion section 158a and decoded signal S23 in the 
time domain is generated. The sampling rate of this 
decoded signal S23 becomes FS=Fx • Nc/Na. 

[0123] FIG. 18 also shows processing by correction 
20 section 251, but in this case NONb. The band of first 
spectrum SI (k) (signal S25) given from combining section 
251 is 0 k<Nb as in the case of FIG. 17. Correction 
section 251 extends the band of Nb^ k<Nc so that the band 
of this first spectrum SI (k) becomes 0^k<Nc and assigns 
25 a specific value (e.g. zero) to the area. As a result, 
first spectrum Sl(k) (0^k<Nc) (signal S26) is given to 
time domain conversion section 158a and decoded signal 
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S27 in the time domain is generated. The sampling rate 
of this decoded signal S27 becomes FS = Fx • Nc/Na. 
[0124] The operation of spectrum decoding section 250 
will be further explained using FIG. 19, FIG.20A and 
5 FIG.20B. 

[0125] First, suppose that the code input through input 
terminal 153 changes from one frame to another. That is, 
suppose that there are three bands in the band from 
combining section 157 as shown in FIG. 19; 0^k<Na (band 
10 Rl), 0^k<Nbl (band R2 ) , 0^k<Nb2 (band R3) (note that 
Na<Nbl<Nb2) and one of these bands is selected for each 
frame . 

[0126] FIG.20A illustrates the operation of the spectrum 
decoding section 250 when coefficient Nc is equal to Nb2, 

15 and FIG.20B illustrates the operation of spectrum 
decoding section 250 when coefficient Nc is equal to Nbl. 
[0127] These figures express that the band of the 
spectrum obtained in the i-th frame is any one of Rl, 
R2, R3 . Furthermore, processing 1 shows the processing 

20 of inserting a zero value in the band of Nbl^k<Nb2, 
processing 2 shows the processing of inserting a zero 
value in the band of Na^k<Nb2, processing 3 shows the 
processing of deleting the band of Nbl ^ k<Nb2 and 
processing 4 shows the processing of inserting a zero 

25 value in the band of Na^k<Nbl. 

[0128] First, the case of FIG.20A will be explained. 
[0129] In this figure, in the 0th frame to the 1st frame 
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and the 7th frame to the 8th frame, since the band of 
the spectrum is R3, that is, the band of first spectrum 
Sl(k) is 0±Sk<Nb2, and therefore correction section 251 
outputs first spectrum Sl(k) (0^k<Nb2) to time domain 
5 conversion section 158a without applying any processing. 
[0130] Furthermore, in the 2nd frame to the 4th frame 
and the 9th frame, since the band of the spectrum is R2 , 
that is, the band of first spectrum Sl(k) is 0^k<Nbl, 
correction section 251 extends the band of first spectrum 
10 Sl(k) to Nb2, inserts a zero value in the band of Nbl 
^k<Nb2 and then outputs first spectrum Sl(k) (0^k<Nb2) 
to time domain conversion section 158a. 

[0131] On the other hand, the band of the spectrum is 
Rl in the 5th frame to the 6th frame, that is, the band 

15 of first spectrum Sl(k) is 0 ^ k<Na, and therefore 
correction section 251 extends the band of first spectrum 
Sl(k) to Nb2, inserts a zero value in the range of Na 
^ k<Nb2 and then outputs first spectrum Sl(k) (0^k<Nb2) 
to time domain conversion section 158a. 

20 [0132] Next, the case of FIG.20B will be explained. 

[0133] In this figure, in the 2nd frame to the 4th frame 
and the 9th frame, the band of the spectrum is R2 , that 
is, the band of first spectrum Sl(k) is 0^k<Nbl, and 
therefore correction section 251 outputs first spectrum 

25 Sl(k) (0^k<Nbl) to time domain conversion section 158a 
without applying any processing. 

[0134] Furthermore, in the 0th frame to the 1st frame, 
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and the 7th frame to the 8th frame, the band of the spectrum 
is R3, that is, the band of first spectrum Sl(k) is 0 
^k<Nb2, correction section 251 deletes the band of Nbl 
^5 k<Nb2 , and then outputs first spectrum Sl(k) ( 0 ^ k<Nbl ) 
5 to time domain conversion section 158a. 

[0135] On the other hand, in the 5th frame to the 6th 
frame, the band of the spectrum is Rl, that is, the band 
of first spectrum Sl(k) is 0 ^ k<Na, and therefore 
correction section 251 extends the band of first spectrum 
10 Sl(k) to Nbl, inserts a zero value in the band of Na 
^ k<Nbl , and then outputs first spectrum Sl(k) ( 0 ^ k<Nbl ) 
to time domain conversion section 158a. 

[0136] According to the this embodiment, even when the 
effective frequency band of received first spectrum SI (k) 
15 changes temporally, appropriate coefficient Nc is given 
in this way, and it is possible to thereby obtain a decoded 
signal at a desired sampling rate stably. 

[0137] (Embodiment 3) 
20 FIG. 21 shows the main configuration of a 

communications system according to of Embodiment 3 of 
the present invention. 

[0138] A feature of this embodiment is to deal with a 
case where the effective frequency band of first spectrum 
25 Sl(k) received on the receiving side changes temporally 
depending on the condition of the communication network 
(communication environment) . 
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[0139] Hierarchical coding section 301 applies the 
hierarchical coding processing shown in Embodiment 1 to 
the input signal of sampling rate Fy and generates a 
scalable code. Here, suppose the generated code is made 
5 up of information (R31) on band 0 ^ k<Ne , information (R32) 
on band Ne^k<Nf and information (R33) on band Nf ^ k<Ng. 
Hierarchical coding section 301 gives this code to network 
control section 302. 

[0140] Network control section 302 transfers a code given 

10 to from hierarchical coding section 301 to hierarchical 
decoding section 303. Here, network control section 302 
discards part of the code to be transferred to hierarchical 
decoding section 303 according to the condition of the 
network. For this reason, the code to be input to 

15 hierarchical decoding section 303 is any one of the code 
made up of information R31 to R33 when there is no code 
to be discarded, the code made up of information R31 and 
R32 when the code of information R33 is discarded and 
the code made up of information R31 when the code of 

20 information R32 and R33 is discarded. 

[0141] Hierarchical decoding section 303 applies the 
hierarchical decoding method shown in Embodiment 1 or 
Embodiment 2 toagivencode andgenerates a decodedsignal. 
When Embodiment 1 is applied to hierarchical decoding 

25 section 303, sampling rate Fz of the output decoded signal 
becomes Fy (because Fz = Fy • Ng/Ng) . Furthermore, when 
Embodiment 2 is applied to hierarchical decoding section 
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303, it is possible to set the sampling rate of the decoded 
signal according to desired coefficient Nc, and sampling 
rate Fz of the decoded signal becomes Fy • Nc/Ng. 
[0142] In this way, according to the this embodiment, 
5 even when the effective frequency band of first spectrum 
Sl(k) received on the receiving' s ide changes temporally 
depending on the condition of the communication network, 
the receiving side can obtain the decoded signal of a 
desired sampling rate stably. 

10 

[0143] (Embodiment 4) 

FIG. 22 shows the main configuration of a 
communications system according to Embodiment 4 of the 
present invention. 

15 [0144] A feature of this embodiment is that even when 
one code generated by one hierarchical coding section 
is simultaneously transmitted to plural hierarchical 
decoding sections having different decodable sampling 
rates (different decoding capacities), the receiving side 

20 can handle the code and obtain decoded signals having 
different sampling rates. 

[0145] Hierarchical coding section 401 applies the 
coding processing shown in Embodiment 1 to the input signal 
of sampling rate Fy and generates a scalable code. Here, 
25 suppose the generated code is made up of information (R41) 
on band 0^k<Nh, information (R42) on band Nh^k<Ni and 
information (R43) on band Ni^k<Nj . Hierarchical coding 
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section 401 gives this code to first hierarchical decoding 
section 402-1, secondhierarchical decoding section 402-2 
and third hierarchical decoding section 402-3 
respectively . 

5 [0146] First hierarchical decoding section 402-1, 
second hierarchical decoding section 402-2 and third 
hierarchical decoding section 402-3 apply the 
hierarchical decoding method shown in Embodiment 1 or 
Embodiment 2 to a given code and generate a decoded signal . 

10 First hierarchical decoding section 402-1 performs 
decoding processing when coefficient Nc=Nj , second 
hierarchical decoding section 402-2 performs decoding 
processing of when coefficient Nc=Ni and third 
hierarchical decoding section 402-3 performs decoding 

15 processing of when coefficient Nc=Nh . 

[0147] First hierarchical decoding section 402-1 
performs decoding processing of when coefficient N c = N j 
and generates a decoded signal. Sampling rate Fl of this 
decoded signal becomes Fy (because F 1 = Fy • N j /N j ) . 

20 [0148] Second hierarchical decoding section 402-2 
performs decoding processing of when coefficient Nc=Ni 
and generates a decoded signal. Sampling rate F2 of this 
decoded signal becomes Fy • Ni/Nj . 

[0149] Third hierarchical decoding section 402-3 
25 performs decoding processing of when coefficient Nc=Nh 
and generates a decoded signal. Sampling rate F3 of this 
decoded signal becomes Fy • Nh/Nj . 
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[0150] In this way, according to this embodiment, the 
transmitting side can transmit a code without considering 
the decoding capacity on the receiving side, and therefore 
it is possible to suppress the load of a communication 
5 network. Furthermore, decoded signals having plural 
types of sampling rates can be generated in a simple 
configuration and with a smaller amount of calculation. 
[0151] The coding apparatus or the decoding apparatus 
according to the present invention can also be mounted 

10 on a communication terminal apparatus and a base station 
apparatus in a mobile communications system, and it is 
possible to thereby provide a communication terminal 
apparatus and a base station apparatus having operations 
and effects similar to those described above. 

15 [0152] Here, the case where the present invention is 
constructed by hardware has been explained as an example 
but the present invention can also be realized by software. 
[0153] The present application is based on Japanese 
Patent Application No. 2003-341717 filed on September 30, 

20 2003, entire content of which is expressly incorporated 
by reference herein. 

Indus trial Applicabi 1 i ty 

[0154] The coding apparatus and the decoding apparatus 
25 according to the present invention have the effect of 
realizing scalable coding in a simple configuration and 
with a small amount of calculation and are suitable for 



44 



use in a communications system such as an IP network. 
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